Overview
Brought to you by YData
Dataset statistics
| Number of variables | 11 |
|---|---|
| Number of observations | 7497 |
| Missing cells | 2711 |
| Missing cells (%) | 3.3% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 644.4 KiB |
| Average record size in memory | 88.0 B |
Variable types
| Text | 1 |
|---|---|
| Categorical | 5 |
| Numeric | 4 |
| Boolean | 1 |
가격(백만원) is highly overall correlated with 모델 and 2 other fields | High correlation |
구동방식 is highly overall correlated with 모델 and 1 other fields | High correlation |
모델 is highly overall correlated with 가격(백만원) and 2 other fields | High correlation |
배터리용량 is highly overall correlated with 가격(백만원) and 2 other fields | High correlation |
보증기간(년) is highly overall correlated with 연식(년) and 2 other fields | High correlation |
연식(년) is highly overall correlated with 보증기간(년) | High correlation |
제조사 is highly overall correlated with 가격(백만원) and 2 other fields | High correlation |
주행거리(km) is highly overall correlated with 배터리용량 and 2 other fields | High correlation |
차량상태 is highly overall correlated with 배터리용량 and 2 other fields | High correlation |
사고이력 is highly imbalanced (73.2%) | Imbalance |
연식(년) is highly imbalanced (52.7%) | Imbalance |
배터리용량 has 2711 (36.2%) missing values | Missing |
ID has unique values | Unique |
보증기간(년) has 618 (8.2%) zeros | Zeros |
Reproduction
| Analysis started | 2025-01-13 12:08:38.276318 |
|---|---|
| Analysis finished | 2025-01-13 12:08:43.533613 |
| Duration | 5.26 seconds |
| Software version | ydata-profiling v0.0.dev0 |
| Download configuration | config.json |
Variables
ID
Text
Unique 
| Distinct | 7497 |
|---|---|
| Distinct (%) | 100.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 58.7 KiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 10 |
| Min length | 10 |
Unique
| Unique | 7497 ? |
|---|---|
| Unique (%) | 100.0% |
Sample
| 1st row | TRAIN_0000 |
|---|---|
| 2nd row | TRAIN_0001 |
| 3rd row | TRAIN_0002 |
| 4th row | TRAIN_0003 |
| 5th row | TRAIN_0004 |
| Value | Count | Frequency (%) |
| train_0000 | 1 | < 0.1% |
| train_0015 | 1 | < 0.1% |
| train_0003 | 1 | < 0.1% |
| train_0004 | 1 | < 0.1% |
| train_0005 | 1 | < 0.1% |
| train_0006 | 1 | < 0.1% |
| train_0007 | 1 | < 0.1% |
| train_0008 | 1 | < 0.1% |
| train_0009 | 1 | < 0.1% |
| train_0010 | 1 | < 0.1% |
| Other values (7487) | 7487 |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 7497 | |
| R | 7497 | |
| A | 7497 | |
| I | 7497 | |
| N | 7497 | |
| _ | 7497 | |
| 0 | 3300 | 4.4% |
| 3 | 3300 | 4.4% |
| 2 | 3300 | 4.4% |
| 1 | 3300 | 4.4% |
| Other values (6) | 16788 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 37485 | |
| Decimal Number | 29988 | |
| Connector Punctuation | 7497 | 10.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 3300 | |
| 3 | 3300 | |
| 2 | 3300 | |
| 1 | 3300 | |
| 4 | 3297 | |
| 5 | 3200 | |
| 6 | 3200 | |
| 7 | 2696 | |
| 8 | 2199 | |
| 9 | 2196 |
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 7497 | |
| R | 7497 | |
| A | 7497 | |
| I | 7497 | |
| N | 7497 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 7497 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 37485 | |
| Common | 37485 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| _ | 7497 | |
| 0 | 3300 | |
| 3 | 3300 | |
| 2 | 3300 | |
| 1 | 3300 | |
| 4 | 3297 | |
| 5 | 3200 | |
| 6 | 3200 | |
| 7 | 2696 | 7.2% |
| 8 | 2199 | 5.9% |
Latin
| Value | Count | Frequency (%) |
| T | 7497 | |
| R | 7497 | |
| A | 7497 | |
| I | 7497 | |
| N | 7497 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 74970 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| T | 7497 | |
| R | 7497 | |
| A | 7497 | |
| I | 7497 | |
| N | 7497 | |
| _ | 7497 | |
| 0 | 3300 | 4.4% |
| 3 | 3300 | 4.4% |
| 2 | 3300 | 4.4% |
| 1 | 3300 | 4.4% |
| Other values (6) | 16788 |
제조사
Categorical
High correlation 
| Distinct | 7 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 58.7 KiB |
| H사 | |
|---|---|
| B사 | |
| K사 | |
| A사 | |
| T사 | |
| Other values (2) |
Length
| Max length | 2 |
|---|---|
| Median length | 2 |
| Mean length | 2 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | P사 |
|---|---|
| 2nd row | K사 |
| 3rd row | A사 |
| 4th row | A사 |
| 5th row | B사 |
Common Values
| Value | Count | Frequency (%) |
| H사 | 1237 | |
| B사 | 1169 | |
| K사 | 1164 | |
| A사 | 1142 | |
| T사 | 1109 | |
| P사 | 1071 | |
| V사 | 605 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| h사 | 1237 | |
| b사 | 1169 | |
| k사 | 1164 | |
| a사 | 1142 | |
| t사 | 1109 | |
| p사 | 1071 | |
| v사 | 605 |
Most occurring characters
| Value | Count | Frequency (%) |
| 사 | 7497 | |
| H | 1237 | 8.2% |
| B | 1169 | 7.8% |
| K | 1164 | 7.8% |
| A | 1142 | 7.6% |
| T | 1109 | 7.4% |
| P | 1071 | 7.1% |
| V | 605 | 4.0% |
Most occurring categories
| Value | Count | Frequency (%) |
| Other Letter | 7497 | |
| Uppercase Letter | 7497 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| H | 1237 | |
| B | 1169 | |
| K | 1164 | |
| A | 1142 | |
| T | 1109 | |
| P | 1071 | |
| V | 605 |
Other Letter
| Value | Count | Frequency (%) |
| 사 | 7497 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Hangul | 7497 | |
| Latin | 7497 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| H | 1237 | |
| B | 1169 | |
| K | 1164 | |
| A | 1142 | |
| T | 1109 | |
| P | 1071 | |
| V | 605 |
Hangul
| Value | Count | Frequency (%) |
| 사 | 7497 |
Most occurring blocks
| Value | Count | Frequency (%) |
| Hangul | 7497 | |
| ASCII | 7497 |
Most frequent character per block
Hangul
| Value | Count | Frequency (%) |
| 사 | 7497 |
ASCII
| Value | Count | Frequency (%) |
| H | 1237 | |
| B | 1169 | |
| K | 1164 | |
| A | 1142 | |
| T | 1109 | |
| P | 1071 | |
| V | 605 |
모델
Categorical
High correlation 
| Distinct | 21 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 58.7 KiB |
| ID4 | |
|---|---|
| i5 | 414 |
| Niro | 398 |
| Soul | 397 |
| i3 | 388 |
| Other values (16) |
Length
| Max length | 6 |
|---|---|
| Median length | 5 |
| Mean length | 3.3305322 |
| Min length | 2 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | TayGTS |
|---|---|
| 2nd row | Niro |
| 3rd row | eT |
| 4th row | RSeTGT |
| 5th row | i5 |
Common Values
| Value | Count | Frequency (%) |
| ID4 | 605 | 8.1% |
| i5 | 414 | 5.5% |
| Niro | 398 | 5.3% |
| Soul | 397 | 5.3% |
| i3 | 388 | 5.2% |
| RSeTGT | 385 | 5.1% |
| eT | 379 | 5.1% |
| ION6 | 379 | 5.1% |
| Q4eT | 378 | 5.0% |
| TayGTS | 375 | 5.0% |
| Other values (11) | 3399 |
Length
| Value | Count | Frequency (%) |
| id4 | 605 | 8.1% |
| i5 | 414 | 5.5% |
| niro | 398 | 5.3% |
| soul | 397 | 5.3% |
| i3 | 388 | 5.2% |
| rsetgt | 385 | 5.1% |
| et | 379 | 5.1% |
| ion6 | 379 | 5.1% |
| q4et | 378 | 5.0% |
| taygts | 375 | 5.0% |
| Other values (11) | 3399 |
Most occurring characters
| Value | Count | Frequency (%) |
| T | 3308 | 13.2% |
| N | 1635 | 6.5% |
| I | 1617 | 6.5% |
| i | 1567 | 6.3% |
| S | 1434 | 5.7% |
| e | 1142 | 4.6% |
| M | 1109 | 4.4% |
| y | 1071 | 4.3% |
| a | 1071 | 4.3% |
| 4 | 983 | 3.9% |
| Other values (18) | 10032 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 14966 | |
| Lowercase Letter | 6838 | |
| Decimal Number | 3165 | 12.7% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| T | 3308 | |
| N | 1635 | |
| I | 1617 | |
| S | 1434 | |
| M | 1109 | 7.4% |
| O | 872 | 5.8% |
| G | 760 | 5.1% |
| E | 734 | 4.9% |
| X | 631 | 4.2% |
| D | 605 | 4.0% |
| Other values (6) | 2261 |
Lowercase Letter
| Value | Count | Frequency (%) |
| i | 1567 | |
| e | 1142 | |
| y | 1071 | |
| a | 1071 | |
| o | 795 | |
| r | 398 | 5.8% |
| l | 397 | 5.8% |
| u | 397 | 5.8% |
Decimal Number
| Value | Count | Frequency (%) |
| 4 | 983 | |
| 5 | 767 | |
| 6 | 748 | |
| 3 | 667 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 21804 | |
| Common | 3165 | 12.7% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| T | 3308 | |
| N | 1635 | 7.5% |
| I | 1617 | 7.4% |
| i | 1567 | 7.2% |
| S | 1434 | 6.6% |
| e | 1142 | 5.2% |
| M | 1109 | 5.1% |
| y | 1071 | 4.9% |
| a | 1071 | 4.9% |
| O | 872 | 4.0% |
| Other values (14) | 6978 |
Common
| Value | Count | Frequency (%) |
| 4 | 983 | |
| 5 | 767 | |
| 6 | 748 | |
| 3 | 667 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 24969 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| T | 3308 | 13.2% |
| N | 1635 | 6.5% |
| I | 1617 | 6.5% |
| i | 1567 | 6.3% |
| S | 1434 | 5.7% |
| e | 1142 | 4.6% |
| M | 1109 | 4.4% |
| y | 1071 | 4.3% |
| a | 1071 | 4.3% |
| 4 | 983 | 3.9% |
| Other values (18) | 10032 |
차량상태
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 58.7 KiB |
| Brand New | |
|---|---|
| Nearly New | |
| Pre-Owned |
Length
| Max length | 10 |
|---|---|
| Median length | 9 |
| Mean length | 9.2746432 |
| Min length | 9 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | Nearly New |
|---|---|
| 2nd row | Nearly New |
| 3rd row | Brand New |
| 4th row | Nearly New |
| 5th row | Pre-Owned |
Common Values
| Value | Count | Frequency (%) |
| Brand New | 3380 | |
| Nearly New | 2059 | |
| Pre-Owned | 2058 |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| new | 5439 | |
| brand | 3380 | |
| nearly | 2059 | 15.9% |
| pre-owned | 2058 | 15.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| e | 11614 | |
| N | 7498 | |
| r | 7497 | |
| w | 7497 | |
| a | 5439 | |
| 5439 | ||
| n | 5438 | |
| d | 5438 | |
| B | 3380 | 4.9% |
| l | 2059 | 3.0% |
| Other values (4) | 8233 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 47041 | |
| Uppercase Letter | 14994 | 21.6% |
| Space Separator | 5439 | 7.8% |
| Dash Punctuation | 2058 | 3.0% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| e | 11614 | |
| r | 7497 | |
| w | 7497 | |
| a | 5439 | |
| n | 5438 | |
| d | 5438 | |
| l | 2059 | 4.4% |
| y | 2059 | 4.4% |
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 7498 | |
| B | 3380 | |
| P | 2058 | 13.7% |
| O | 2058 | 13.7% |
Space Separator
| Value | Count | Frequency (%) |
| 5439 |
Dash Punctuation
| Value | Count | Frequency (%) |
| - | 2058 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 62035 | |
| Common | 7497 | 10.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| e | 11614 | |
| N | 7498 | |
| r | 7497 | |
| w | 7497 | |
| a | 5439 | |
| n | 5438 | |
| d | 5438 | |
| B | 3380 | 5.4% |
| l | 2059 | 3.3% |
| y | 2059 | 3.3% |
| Other values (2) | 4116 | 6.6% |
Common
| Value | Count | Frequency (%) |
| 5439 | ||
| - | 2058 | 27.5% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 69532 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| e | 11614 | |
| N | 7498 | |
| r | 7497 | |
| w | 7497 | |
| a | 5439 | |
| 5439 | ||
| n | 5438 | |
| d | 5438 | |
| B | 3380 | 4.9% |
| l | 2059 | 3.0% |
| Other values (4) | 8233 |
배터리용량
Real number (ℝ)
High correlation  Missing 
| Distinct | 194 |
|---|---|
| Distinct (%) | 4.1% |
| Missing | 2711 |
| Missing (%) | 36.2% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 69.397187 |
| Minimum | 46 |
|---|---|
| Maximum | 99.8 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 58.7 KiB |
Quantile statistics
| Minimum | 46 |
|---|---|
| 5-th percentile | 46.169 |
| Q1 | 56.359 |
| median | 68.125 |
| Q3 | 78.227 |
| 95-th percentile | 96 |
| Maximum | 99.8 |
| Range | 53.8 |
| Interquartile range (IQR) | 21.868 |
Descriptive statistics
| Standard deviation | 15.283635 |
|---|---|
| Coefficient of variation (CV) | 0.22023422 |
| Kurtosis | -0.96354826 |
| Mean | 69.397187 |
| Median Absolute Deviation (MAD) | 11.502 |
| Skewness | 0.39214255 |
| Sum | 332134.94 |
| Variance | 233.58951 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 90 | 559 | 7.5% |
| 56 | 327 | 4.4% |
| 46 | 223 | 3.0% |
| 68.488 | 202 | 2.7% |
| 76.093 | 186 | 2.5% |
| 96 | 136 | 1.8% |
| 99.8 | 116 | 1.5% |
| 91.2 | 91 | 1.2% |
| 46.169 | 88 | 1.2% |
| 93.4 | 87 | 1.2% |
| Other values (184) | 2771 | |
| (Missing) | 2711 |
| Value | Count | Frequency (%) |
| 46 | 223 | |
| 46.09 | 1 | < 0.1% |
| 46.13 | 1 | < 0.1% |
| 46.15 | 2 | < 0.1% |
| 46.169 | 88 | 1.2% |
| 46.21 | 1 | < 0.1% |
| 46.26 | 1 | < 0.1% |
| 46.34 | 1 | < 0.1% |
| 46.42 | 1 | < 0.1% |
| 46.93 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 99.8 | 116 | 1.5% |
| 96 | 136 | 1.8% |
| 95 | 83 | 1.1% |
| 93.4 | 87 | 1.2% |
| 92.16 | 40 | 0.5% |
| 91.2 | 91 | 1.2% |
| 90 | 559 | |
| 88.474 | 7 | 0.1% |
| 88.08 | 1 | < 0.1% |
| 87.552 | 7 | 0.1% |
구동방식
Categorical
High correlation 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 58.7 KiB |
| AWD | |
|---|---|
| FWD | |
| RWD |
Length
| Max length | 3 |
|---|---|
| Median length | 3 |
| Mean length | 3 |
| Min length | 3 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | AWD |
|---|---|
| 2nd row | FWD |
| 3rd row | AWD |
| 4th row | AWD |
| 5th row | AWD |
Common Values
| Value | Count | Frequency (%) |
| AWD | 5167 | |
| FWD | 1267 | 16.9% |
| RWD | 1063 | 14.2% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| awd | 5167 | |
| fwd | 1267 | 16.9% |
| rwd | 1063 | 14.2% |
Most occurring characters
| Value | Count | Frequency (%) |
| W | 7497 | |
| D | 7497 | |
| A | 5167 | |
| F | 1267 | 5.6% |
| R | 1063 | 4.7% |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 22491 |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| W | 7497 | |
| D | 7497 | |
| A | 5167 | |
| F | 1267 | 5.6% |
| R | 1063 | 4.7% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 22491 |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| W | 7497 | |
| D | 7497 | |
| A | 5167 | |
| F | 1267 | 5.6% |
| R | 1063 | 4.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 22491 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| W | 7497 | |
| D | 7497 | |
| A | 5167 | |
| F | 1267 | 5.6% |
| R | 1063 | 4.7% |
주행거리(km)
Real number (ℝ)
High correlation 
| Distinct | 6916 |
|---|---|
| Distinct (%) | 92.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 44287.979 |
| Minimum | 3 |
|---|---|
| Maximum | 199827 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 58.7 KiB |
Quantile statistics
| Minimum | 3 |
|---|---|
| 5-th percentile | 1043.8 |
| Q1 | 5465 |
| median | 17331 |
| Q3 | 61252 |
| 95-th percentile | 174270.2 |
| Maximum | 199827 |
| Range | 199824 |
| Interquartile range (IQR) | 55787 |
Descriptive statistics
| Standard deviation | 55204.064 |
|---|---|
| Coefficient of variation (CV) | 1.2464796 |
| Kurtosis | 0.69888503 |
| Mean | 44287.979 |
| Median Absolute Deviation (MAD) | 15061 |
| Skewness | 1.3929303 |
| Sum | 3.3202698 × 108 |
| Variance | 3.0474887 × 109 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 7233 | 4 | 0.1% |
| 3012 | 4 | 0.1% |
| 1631 | 4 | 0.1% |
| 978 | 4 | 0.1% |
| 9396 | 3 | < 0.1% |
| 2571 | 3 | < 0.1% |
| 9413 | 3 | < 0.1% |
| 3861 | 3 | < 0.1% |
| 38158 | 3 | < 0.1% |
| 5078 | 3 | < 0.1% |
| Other values (6906) | 7463 |
| Value | Count | Frequency (%) |
| 3 | 1 | |
| 4 | 2 | |
| 6 | 1 | |
| 15 | 2 | |
| 16 | 1 | |
| 26 | 2 | |
| 30 | 1 | |
| 31 | 1 | |
| 32 | 2 | |
| 33 | 1 |
| Value | Count | Frequency (%) |
| 199827 | 1 | |
| 199819 | 1 | |
| 199818 | 1 | |
| 199766 | 1 | |
| 199760 | 1 | |
| 199647 | 1 | |
| 199515 | 1 | |
| 199457 | 1 | |
| 199384 | 1 | |
| 199329 | 1 |
보증기간(년)
Real number (ℝ)
High correlation  Zeros 
| Distinct | 11 |
|---|---|
| Distinct (%) | 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 4.9609177 |
| Minimum | 0 |
|---|---|
| Maximum | 10 |
| Zeros | 618 |
| Zeros (%) | 8.2% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 58.7 KiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0 |
| Q1 | 2 |
| median | 5 |
| Q3 | 8 |
| 95-th percentile | 10 |
| Maximum | 10 |
| Range | 10 |
| Interquartile range (IQR) | 6 |
Descriptive statistics
| Standard deviation | 3.155342 |
|---|---|
| Coefficient of variation (CV) | 0.63603998 |
| Kurtosis | -1.3757981 |
| Mean | 4.9609177 |
| Median Absolute Deviation (MAD) | 3 |
| Skewness | -0.036224811 |
| Sum | 37192 |
| Variance | 9.9561831 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 2 | 1358 | |
| 7 | 1093 | |
| 8 | 1073 | |
| 0 | 618 | |
| 1 | 552 | |
| 10 | 522 | 7.0% |
| 9 | 515 | 6.9% |
| 3 | 494 | 6.6% |
| 5 | 428 | 5.7% |
| 4 | 426 | 5.7% |
| Value | Count | Frequency (%) |
| 0 | 618 | |
| 1 | 552 | |
| 2 | 1358 | |
| 3 | 494 | 6.6% |
| 4 | 426 | 5.7% |
| 5 | 428 | 5.7% |
| 6 | 418 | 5.6% |
| 7 | 1093 | |
| 8 | 1073 | |
| 9 | 515 | 6.9% |
| Value | Count | Frequency (%) |
| 10 | 522 | 7.0% |
| 9 | 515 | 6.9% |
| 8 | 1073 | |
| 7 | 1093 | |
| 6 | 418 | 5.6% |
| 5 | 428 | 5.7% |
| 4 | 426 | 5.7% |
| 3 | 494 | 6.6% |
| 2 | 1358 | |
| 1 | 552 |
사고이력
Boolean
Imbalance 
| Distinct | 2 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 7.5 KiB |
| False | |
|---|---|
| True | 343 |
| Value | Count | Frequency (%) |
| False | 7154 | |
| True | 343 | 4.6% |
연식(년)
Categorical
High correlation  Imbalance 
| Distinct | 3 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 58.7 KiB |
| 0 | |
|---|---|
| 2 | 566 |
| 1 | 536 |
Length
| Max length | 1 |
|---|---|
| Median length | 1 |
| Mean length | 1 |
| Min length | 1 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | 2 |
|---|---|
| 2nd row | 0 |
| 3rd row | 0 |
| 4th row | 0 |
| 5th row | 0 |
Common Values
| Value | Count | Frequency (%) |
| 0 | 6395 | |
| 2 | 566 | 7.5% |
| 1 | 536 | 7.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| 0 | 6395 | |
| 2 | 566 | 7.5% |
| 1 | 536 | 7.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| 0 | 6395 | |
| 2 | 566 | 7.5% |
| 1 | 536 | 7.1% |
Most occurring categories
| Value | Count | Frequency (%) |
| Decimal Number | 7497 |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 0 | 6395 | |
| 2 | 566 | 7.5% |
| 1 | 536 | 7.1% |
Most occurring scripts
| Value | Count | Frequency (%) |
| Common | 7497 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| 0 | 6395 | |
| 2 | 566 | 7.5% |
| 1 | 536 | 7.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 7497 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| 0 | 6395 | |
| 2 | 566 | 7.5% |
| 1 | 536 | 7.1% |
가격(백만원)
Real number (ℝ)
High correlation 
| Distinct | 3950 |
|---|---|
| Distinct (%) | 52.7% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 62.331949 |
| Minimum | 9 |
|---|---|
| Maximum | 161.09 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 58.7 KiB |
Quantile statistics
| Minimum | 9 |
|---|---|
| 5-th percentile | 22.29 |
| Q1 | 34.39 |
| median | 56 |
| Q3 | 80.05 |
| 95-th percentile | 135.8 |
| Maximum | 161.09 |
| Range | 152.09 |
| Interquartile range (IQR) | 45.66 |
Descriptive statistics
| Standard deviation | 36.646759 |
|---|---|
| Coefficient of variation (CV) | 0.58792898 |
| Kurtosis | 0.35800952 |
| Mean | 62.331949 |
| Median Absolute Deviation (MAD) | 23.1 |
| Skewness | 1.0033363 |
| Sum | 467302.62 |
| Variance | 1342.985 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 160 | 48 | 0.6% |
| 60 | 45 | 0.6% |
| 100 | 43 | 0.6% |
| 39 | 39 | 0.5% |
| 24 | 36 | 0.5% |
| 35 | 34 | 0.5% |
| 38 | 33 | 0.4% |
| 36 | 30 | 0.4% |
| 23.5 | 25 | 0.3% |
| 99 | 22 | 0.3% |
| Other values (3940) | 7142 |
| Value | Count | Frequency (%) |
| 9 | 3 | |
| 9.38 | 1 | < 0.1% |
| 9.66 | 1 | < 0.1% |
| 9.77 | 1 | < 0.1% |
| 9.83 | 1 | < 0.1% |
| 9.92 | 1 | < 0.1% |
| 9.94 | 1 | < 0.1% |
| 10.22 | 1 | < 0.1% |
| 10.46 | 1 | < 0.1% |
| 10.85 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 161.09 | 1 | < 0.1% |
| 161.01 | 1 | < 0.1% |
| 160.99 | 2 | |
| 160.96 | 2 | |
| 160.95 | 1 | < 0.1% |
| 160.94 | 1 | < 0.1% |
| 160.91 | 1 | < 0.1% |
| 160.87 | 2 | |
| 160.86 | 1 | < 0.1% |
| 160.84 | 3 |
Interactions
Correlations
| 가격(백만원) | 구동방식 | 모델 | 배터리용량 | 보증기간(년) | 사고이력 | 연식(년) | 제조사 | 주행거리(km) | 차량상태 | |
|---|---|---|---|---|---|---|---|---|---|---|
| 가격(백만원) | 1.000 | 0.431 | 0.792 | 0.581 | -0.269 | 0.000 | 0.113 | 0.595 | -0.108 | 0.189 |
| 구동방식 | 0.431 | 1.000 | 0.850 | 0.305 | 0.304 | 0.000 | 0.041 | 0.677 | 0.061 | 0.026 |
| 모델 | 0.792 | 0.850 | 1.000 | 0.466 | 0.381 | 0.000 | 0.167 | 0.999 | 0.146 | 0.306 |
| 배터리용량 | 0.581 | 0.305 | 0.466 | 1.000 | 0.487 | 0.000 | 0.346 | 0.376 | -0.661 | 0.768 |
| 보증기간(년) | -0.269 | 0.304 | 0.381 | 0.487 | 1.000 | 0.000 | 0.582 | 0.389 | -0.707 | 0.758 |
| 사고이력 | 0.000 | 0.000 | 0.000 | 0.000 | 0.000 | 1.000 | 0.010 | 0.000 | 0.026 | 0.000 |
| 연식(년) | 0.113 | 0.041 | 0.167 | 0.346 | 0.582 | 0.010 | 1.000 | 0.037 | 0.363 | 0.450 |
| 제조사 | 0.595 | 0.677 | 0.999 | 0.376 | 0.389 | 0.000 | 0.037 | 1.000 | 0.034 | 0.008 |
| 주행거리(km) | -0.108 | 0.061 | 0.146 | -0.661 | -0.707 | 0.026 | 0.363 | 0.034 | 1.000 | 0.867 |
| 차량상태 | 0.189 | 0.026 | 0.306 | 0.768 | 0.758 | 0.000 | 0.450 | 0.008 | 0.867 | 1.000 |
Missing values
Sample
| ID | 제조사 | 모델 | 차량상태 | 배터리용량 | 구동방식 | 주행거리(km) | 보증기간(년) | 사고이력 | 연식(년) | 가격(백만원) | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | TRAIN_0000 | P사 | TayGTS | Nearly New | 86.077 | AWD | 13642 | 0 | No | 2 | 159.66 |
| 1 | TRAIN_0001 | K사 | Niro | Nearly New | 56.000 | FWD | 10199 | 6 | No | 0 | 28.01 |
| 2 | TRAIN_0002 | A사 | eT | Brand New | 91.200 | AWD | 2361 | 7 | No | 0 | 66.27 |
| 3 | TRAIN_0003 | A사 | RSeTGT | Nearly New | NaN | AWD | 21683 | 3 | No | 0 | 99.16 |
| 4 | TRAIN_0004 | B사 | i5 | Pre-Owned | 61.018 | AWD | 178205 | 1 | No | 0 | 62.02 |
| 5 | TRAIN_0005 | H사 | ION6 | Pre-Owned | 58.162 | AWD | 103100 | 3 | No | 0 | 37.02 |
| 6 | TRAIN_0006 | T사 | MS | Nearly New | NaN | AWD | 19395 | 3 | No | 0 | 83.42 |
| 7 | TRAIN_0007 | A사 | RSeTGT | Nearly New | 78.227 | AWD | 30583 | 5 | No | 1 | 99.66 |
| 8 | TRAIN_0008 | T사 | MY | Brand New | NaN | AWD | 2226 | 8 | No | 0 | 74.06 |
| 9 | TRAIN_0009 | A사 | Q4eT | Brand New | NaN | AWD | 3683 | 7 | No | 0 | 59.66 |
| ID | 제조사 | 모델 | 차량상태 | 배터리용량 | 구동방식 | 주행거리(km) | 보증기간(년) | 사고이력 | 연식(년) | 가격(백만원) | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| 7487 | TRAIN_7487 | H사 | IONIQ | Nearly New | 67.17 | FWD | 29028 | 3 | No | 1 | 11.39 |
| 7488 | TRAIN_7488 | T사 | M3 | Brand New | NaN | RWD | 3839 | 7 | Yes | 0 | 46.46 |
| 7489 | TRAIN_7489 | H사 | ION5 | Brand New | NaN | AWD | 8871 | 9 | No | 0 | 35.83 |
| 7490 | TRAIN_7490 | A사 | Q4eT | Brand New | NaN | AWD | 5794 | 7 | No | 0 | 59.95 |
| 7491 | TRAIN_7491 | K사 | Soul | Brand New | NaN | FWD | 5966 | 10 | No | 0 | 16.75 |
| 7492 | TRAIN_7492 | H사 | ION5 | Brand New | NaN | AWD | 3773 | 10 | No | 0 | 35.95 |
| 7493 | TRAIN_7493 | B사 | i3 | Pre-Owned | 46.00 | RWD | 135411 | 2 | No | 0 | 23.40 |
| 7494 | TRAIN_7494 | P사 | TayCT | Brand New | NaN | AWD | 1363 | 2 | No | 0 | 120.00 |
| 7495 | TRAIN_7495 | B사 | i3 | Nearly New | 56.00 | RWD | 39445 | 6 | No | 2 | 24.00 |
| 7496 | TRAIN_7496 | T사 | MY | Pre-Owned | 51.94 | AWD | 80215 | 0 | No | 0 | 74.06 |